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The potential of gravitational lenses for providing direct, physical measurements of the Hubble 
constant, free from systematic errors associated with the traditional distance ladder, has long 
been recognized. However, it is only recently that there has been a convincing measurement 
of a time delay sufficiently accurate to carry out this program. By itself, an accurate time 
delay measurement does not produce an equivalently definite Hubble constant and the errors 
associated with models of the primary lens, propagation through the potential fluctuations 
produced by the large-scale structure and the global geometry of the universe must also be 
taken into account. The prospects for measuring several more time delays and the feasibility of 
making the corresponding estimates of the Hubble constant with total error smaller than ten 
percent are critically assessed. 



1. Introduction 

Ever since the prescient work of Refsdal (1964, 1966), extragalactic astronomers have 
known that a determination of the time delay in the variation of a multiply imaged 
quasar could produce a measurement of the size and age of the universe. The first 
definite example of multiple imaging was discovered in the "double quasar" 0957+56lA,B 
(Walsh, Carswell & Weymann 1979), and, as it seemed that this dream might be realized, 
several groups began monitoring it in the hope of carrying out this program]]] It took 17 
years for a universally accepted time delay to be measured in this system (Kundic 1996), 
after a long and controversial series of papers on the subject (for a review, see Haarsma 
et al. 1996). The effort in constraining the models of the lensing mass distribution has 
paralleled the time delay observations, resulting in a robust model of the system (Grogin 
& Narayan 1996). Many other promising lenses have been discovered since, and there 
is now considerable optimism that Refsdal's technique, like many others discussed at 
this meeting, is ripe for exploitation. In this brief review, we will try to summarize 
recent developments in the study of gravitational lensing insofar as they are relevant to 
measurement of the Hubble constant. For more detail on the history and the basic theory, 
the reader is referred to Blandford & Narayan (1992); Schneider, Ehlers & Falco (1992); 
and Narayan & Bartelmann (1996). Recent observational and theoretical developments 
are presented in the 173rd IAU Symposium proceedings (Kochanek & Hewitt 1996). 



2. The method 

Gravitational lensing relies upon the the propensity of light to follow null geodesies in 
curved spacetime. This means that if we try to fit the round peg that is the curved space 
around a mass M into the square hole that is the pre-relativity framework of Euclid, 
Newton, and Kant, then light rays will appear to be deflected through an angle 
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t In fact, they were doing this under false pretenses because the original estimate of the time 
delay was five times too large, due to an uncharacteristic slip in Young et al. (1980). 
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where b is the impact parameter. Now, an elementary calculation of this deflection angle, 
analogous to the calculation of the deflection of an ultrarelativistic electron passing by an 
atomic nucleus, gives precisely half this deflection. When one analyzes the general rela- 
tivistic calculation, one finds that the other half of the deflection is directly attributable 
to the space curvature and that this is a peculiar prediction of general relativity. This 
prediction has been verified with a relative accuracy of ~ 0.001 in a measurement of the 
solar deflection (Lebach et al. 1985) and we can take this part of the theory for granted. 

It turns out to be quite useful (and rigorously defensible), to use the Newtonian frame- 
work to think in terms of this "deflection" and to treat space as flat but endowed with 
an artificial refractive index 

2$ 

R=l--=-. (2.2) 

where $(r) is the conventional Newtonian gravitational potential which — > as r — > oo 
(Eddington 1919). (As $ < 0, the refractive index exceeds unity and rays are deflected 
toward potential wells.) This also implies that when photons travel along a ray, they will 
appear to travel slower than in vacuo and will take an extra time Ai grav to pass by a 
massive object, where 

Ai grav = ~ J ds $ (2.3) 

and where the integral is performed along the ray. (This effect is also known as the 
"Shapiro delay" and it has been measured with a fractional accuracy ~ 0.002 in the 
solar system by Reasenberg et al. 1979). If the deflector is at a cosmological distance (at 
a redshift Zd), the gravitational delay measured by the observer will be (1 + Zd) times 
longer, where the expansion factor takes into account the lengthening of time interval 
proportional to the lengthening of wave periods. 

There is a second, geometrical contribution to the total time delay. In order to compute 
this contribution, it is necessary to take account of the fact that the global geometry of 
the universe is not necessarily flat. Fortunately, this can be done by defining an angular 
diameter distance (e.g. Weinberg 1972), which is the ratio of the proper size of a small 
source at the time of emission to the angle that it subtends at a distant observer. (In 
computing the angular diameter distance, it is necessary to allow for the fact that the 
universe expands as the light propagates from the source to the observer.) Now define 
angular diameter distances from the observer to the deflector and the source by Dd, D s 
respectively and from the deflector to the source by Dd s - If we compare the true deflected 
ray with the unperturbed ray in the absence of the deflector, then elementary geometry 
tells us that the separation of the two rays at the deflector is given by £ = DdDd s oc/ D s 
(Fig. Q). Now imagine two waves, one emanating from the source at the time of emission, 
the other emanating from the observer backward in time leaving now and let these two 
waves meet tangentially at the deflector along the undeflected ray. The extra geometrical 
path is simply the separation of these wavefronts at the deflector along the deflected ray, 
a distance £ from the undeflected ray. As the rays are normal to the wavefronts at the 
deflector, we see that the geometrical path difference at the deflector is just £ • a/2. 
Again, we must multiply by (1 + 2<j). The net result is an expression for the geometrical 
time delay 

Ai gcom = (l + z d )^^a 2 . (2.4) 

The total time delay is the sum of the gravitational and the geometrical contributions. 

Both the gravitational and the geometrical time delays are of comparable magnitude 
~ a 2 /H for cosmologically distant sources. For typical deflections ~ 1", this would 
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Figure 1. Illustration of the geometrical time delay in a simplified lensing geometry. The 
two wavefronts shown in the figure represent one wave emanating from the source and the other 
wave emanating from the observer backward in time. They meet tangentially at the deflector 
along the undeflected ray. The separation between these two wavefronts along the deflected ray 
is given by the expression £ • a/2, regardless of the space curvature. 



lead to an estimate At ~ 1 yr. Curiously, for many of the sources in which we are 
most interested, it turns out that the actual delays are much smaller than this estimate 
(by up to three orders of magnitude). This is because the gravitational and geometrical 
time delays tend to cancel each other out and because we tend to select observationally 
highly magnified examples of gravitational lensing in which the image arrangement is 
quite symmetrical. 



In order to evaluate Eqs. 2.3 and 2.4 in a given system, we must construct a model 
of the deflector. For a general mass distribution, the deflection angle, (which can be 
regarded as a two dimensional vector as long as it is small), is given by 

& = J ds n x (fix V$) (2.5) 

where n is a unit vector along the ray. We can now use this deflection to solve for the 
rays. Let us measure the angular position of the undeflected ray as seen by the observer, 
relative to an arbitrary origin on the sky, by and the position of the deflected ray by 
9 (Fig. [I]) . These angles must satisfy the general lens equation 

= 0- a{6) (2.6) 

where 



a is the reduced deflection angle. When Eq. 2.t has more than one solution, we have a 



strong gravitational lens producing multiple images. Common strong lenses form two or 
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four images. We can use use Eq. 2.6 to write the total time delay in the form 

\e-pf 



At = K 



where 



and 



K = (1 + z A ) 



(2.8) 



(2.9) 



D d D s c 2 

is the scaled surface potential which satisfies the two-dimensional Poisson equation 



(2.10) 



V 2 * = 



2£(0) 



(2.11) 



and where 



is the reduced deflection angle. 



V* = a (2.12) 
S is the surface density in the lens plane and S c = 



c D s / AnGDfisDd is the so-called critical density. The derivatives are performed with 
respect to 0. 

Next, we must calculate the observed magnification. Suppose that we have a small but 
finite source that is resolved by the observer. In the absence of the deflector, the source 
will appear at position (3 on the sky. After deflection, the j3 plane will be mapped onto 
the 9 plane. The Hessian tensor 

dpi 



H ij - -QQ - - S ij - <1> <.- 



(2.13) 



relates the source to the image. (Note that, as the deflection is itself the gradient of a 
potential, this tensor is symmetric and only has three independent components.) It is 
usual to relate the image to the source and this requires the magnification tensor which 
is the inverse of Eq. 2.13: 

d0 t 



fJ'ij 



d[3i 



(2.14) 



This magnification tensor can be decomposed into an isotropic expansion and a trace- free 
pure shear. As it is symmetric, there is no rotation. 

Now the usual scalar magnification, denoted by /i, is the ratio of the flux observed 
from an unresolved source seen through the deflector to the flux that would have been 
measured in the absence of the deflector. As the intensity is unchanged by the deflector, 
this ratio is simply the ratio of the solid angles subtended by the ratio of the flux with 
and without lensing, given by the Jacobian 



89, 



(2.15) 



These magnifications are not directly observable. Rather, it is the ratio of the magni- 
fications of separate images of the same source that one measures. (Of course this may 
have to be done at different times of observation if the source varies so as to make the 
comparison at the same time of emission.) Similarly, if we are able to resolve angular 
structure in multiple images of a compact source, for example using VLBI, then we can 
also measure the relative magnification tensor relating two images, A, B 



u AB 



(2.16) 
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This tensor need not be symmetric. 

The procedure for estimating the Hubble constant then consists of using the observed 
positions and magnifications of multiple images of the same source to construct a model 
of the imaging geometry which allows us to deduce (3 for the sources and \& for all the 
images. The total time delay for each image can then be computed in the model up 



to a multiplicative, redshift-dependent factor K given by Eq. 2.9, which is inversely 



proportional to Hq. If the time lags between the variation of two (or more) images can 
be measured, it is then possible to get an estimate of H . 



3. Measuring time delays 

Consider a variable source which produces a variation that can be observed using more 
than one ray. As the travel time along these rays differs, the corresponding images will 
vary in brightness at different observing times. If the source is continuously variable, 
then we should be able to measure the time delay using cross-correlation techniques. 
The precision with which it can be measured depends on the amplitude and timescale of 
the source variability, on the frequency of observations and photometric accuracy, and 
on the value of the time delay itself. 

Let us consider optical observations of quasars. Assuming that quasar variability is a 
stationary process, it can be described in terms of the first-order structure function: 

V(r) = ([ m (t + r)-m(t)} 2 ) , (3.17) 

where m(t), m(t + r) are quasar magnitudes recorded at two epochs separated by an 
interval r (Simonetti, Cordes & Heeschen 1985). The qualifier "stationary" refers to the 
assumption that V is not a function of the observation epoch t, but only of the time lag 
r between two points on the light curve. In practice, V(r) can be well approximated by 
a power law for time delays between a few days and a few years: 

[V{t)] x ' 2 -0.015 mag (r/day) ' 35 , (3.18) 

where the numerical coefficients were estimated from Press, Rybicki & Hewitt (1992a); 
Cristiani et al. (1996); and from the light curves of several lenses monitored at the Apache 
Point Observatory. The amplitude of V(t) depends on the bandpass and it is larger at 
bluer wavelengths. 



Equation 3.18 illustrates the difficulties associated with measuring short time delays. 
If the predicted delay is a few days, the expected amplitude of quasar variations is 
only — 0.02 magnitudes, requiring millimagnitude photometry in systems that are often 
complicated and marginally resolved from the ground (In quadruple lenses, one has to 
resolve four quasar images in a — 1" radius, superimposed on the diffuse light of the 
lensing galaxy). Even in a resolved system with a much longer time delay, the double 
quasar 0957+561, there has been much debate about the correct value of the delay, 
with estimates ranging from — 540 days (Lehar et al. 1992; Press et al. 1992a, 1992b; 
Beskin & Oknyanskij 1995) to - 420 days (Vanderriest et al. 1989; Schild & Cholfin 
1986; Schild & Thomson 1995; Pelt et al. 1994, 1996). Optical data recently acquired at 
the Apache Point Observatory by Kundic et al. (1995, 1996) unambiguously measure a 
delay of At = 417 ± 3 days, vindicating the second group. This result required — 100 
flux measurements in two observing seasons with a median photometric error of < 1%. 

A less accurate time delay of 12±3 days has been reported for the radio ring B0218+357 
on the basis of radio polarization measurements (Corbett et al. 1996). The advantage 
of the polarization method over direct photometry is that only one parameter (time 
delay) is needed to align light curves of two images. Alignment of photometric light 
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curves requires an additional parameter, corresponding to the relative magnification of 
the two images. More recently, Schechter et al. (1996) reported a measurement of two 
time delays in the quadruple quasar PG1115+080: 23.7 ± 3.4 days between images B 
and C and 9.4 days between images C and A. Unfortunately, it is not yet possible to 
model PG1 115+080 accurately and so this impressive observation cannot furnish a useful 
estimate of the Hubble constant at this time. Sources like B1422+231 (Hjorth et al. 1996) 
and PKS1830-211 (van Ommen et al. 1995) are also currently being monitored. 



4. Modeling strong galaxy lenses 

4.1. The models 

Any estimate of Hq derived from gravitational lensing will be no better than the lens 
model that is deduced from observations of the positions and fluxes of the individual 
images. Lens modeling is, necessarily, a rather subjective business. The procedure that 
has to be followed depends upon whether or not the images are resolved. In most cases, 
there are N unresolved point images giving 2(N — 1) relative coordinates. There are 
also (N — 1) relative fluxes which can be fit to the magnification ratios. Since sources 
are variable, fluxes must be compared at the same emission time. (A complication 
that can arise at optical wavelengths is that individual stars in the lensing galaxy can 
cause additional, variable magnification, called microlensing, if the source is sufficiently 
compact. However, this is unlikely to be a problem at infrared or radio wavelengths.) 
So, with N point images, there are usually 3(N — 1) observables which can be used in 
model fitting. Once the relative time delays in the system are measured, they provide 
(N — 2) additional constraints. 

Resolved images contain more information. Firstly, when compact radio sources are 
resolved using VLBI, the individ ual im ages ought to be related by a simple, four param- 



eter, linear transformation (Eq. 2.16 ). This can be measured and is more useful than 
the flux ratio alone in constraining the model, and would give an additional 3(N — 1) 
observables. Secondly, if the radio structure is large enough, it is possible to expand to 
one higher order and measure the gradient of fiij . Thirdly, some sources are so extended 
that they form ring-like images. This can happen at either radio or optical wavelengths. 
In this case we have a large number, perhaps thousands of independent pixels to match 
up. In principle, this can lead to a highly constrained lens (and source) model. The 
best techniques for tackling this problem at radio wavelengths (Wallington, Kochanek 
& Narayan 1996) incorporate the model fitting as a stage in the map making procedure 
and, in some cases this can lead to impressively good models. 

In order to model a simple gravitational lens, the scaled potential \& of the lensing 
galaxy, (plus any additional galaxies close enough to contribute to the deflection) is 
modeled using a simple function that contains adjustable parameters that measure the 
depth of the potential, its core radius, its ellipticity and its radial variation. A convenient 
functional form is the elliptical potential 

= b[s 2 + (1 - lc )x 2 - 2 ls xy + (1 + 7c )y 2 ]« (4.19) 

where = (x, y) measures distance from the center of the potential which is located at 
(xo,yo). The two parameters 7 c ,7s combine to describe the ellipticity and the position 
angle of the major axis. The exponent q measures the radial variation of the density at 
large radius. A value q = 0.5 corresponds to an isothermal sphere and a value q = 1 
has asymptotically S oc r~ 2 as might be appropriate if, for example, mass traces light 
as in a Hubble profile. The elliptical potential has the advantage that it is quick to 
compute and good for searches in multi-dimensional parameter space. Its drawback is 
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that the corresponding surface density is peanut-shaped at large ellipticity. Either two 
potentials must be superposed to generate realistic surface density distributions or a 
choice made from a smorgasbord of more complicated potentials (e.g. Schneider et al. 
1992). In some cases, these potentials can be located and oriented using the observed 
galaxy image. However, we know that the mass in galaxies does not completely trace the 
light; in particular it diminishes more slowly with increasing radius. Therefore, we still 
have some freedom in modeling the mass distribution of lensing galaxies, even if their 
light profiles are well known. An example of the elliptical potential model is shown in 
Fig. [| Motivated by the optical HST images of 1608+656, two elliptical potentials are 
superimposed to produce the observed quad configuration. 

It is also conventional to augment the lensing galaxy potential with a simple quadratic 
form, 

* q (9) = -T c (x 2 - v 2 ) - 2T s xy , (4.20) 

which is trace-free and corresponds to a pure shear of the images. The external shear 
has two origins: dark matter lying in the lens plane associated with galaxies outside the 
strong lensing region and large-scale structure along the line of sight, as we discuss in 
more detail below. 

A common procedure for modeling well observed gravitational lenses is to decide upon 
a list of independent observables and a smaller number of lens model parameters. These 
parameters are then varied so as to minimize some measure of the goodness of fit, typ- 
ically a x 2 associated with the differences in the observables as predicted by the model 
from those actually measured. This approach has the merit of rewarding simplicity. How- 
ever, the fit is usually somewhat imperfect and it is difficult to assign a formal error. A 
somewhat better approach, that is much harder to implement in practice, is to construct 
families of more elaborate models that may contain more parameters than observables 
and which are consequently underdetermined. Using this approach, it ought to be possi- 
ble to derive many models that exactly recover the observables. We can then assign two 
types of error to the model time delay. One is associated with the measurement errors 
in the observables, the other depends upon the freedom in the models and it is here that 
the outcome depends most subjectively upon what we are prepared to countenance. 

4.2. Modeling "quad" sources 

Let us describe some of the dangers and possibilities involved in model fitting by consider- 
ing image formation in a simple case, the so-called "quad" sources. These are quadruply 
imaged point sources formed by a galaxy for which the circular symmetry is broken by 
an elliptical perturbation. (Depending upon the nature of the potential in the galaxy 
nucleus, there may also be a fifth, faint, central image which we shall ignore.) For il- 
lustration, we use the simplest possible model of a singular isothermal sphere potential 
perturbed by a quadratic shear, and work to first order in the ellipticity (c/ Blandford & 
Kovner 1988). 

We write the potential in the form 

W s (0) = 9 9 - ie6» 2 cos 2</> , (4.21) 

where polar coordinates 9 — (9, </>) refer to the center of the potential located at the 
center of the observed lensing galaxy (Fig. |J). The first term ty° s = 9$9 describes an 
isothermal sphere (E oc _1 ), and is circularly symmetric. The second term is the non- 
circular perturbation. We assume that we know its orientation from the shape of the 
galaxy but treat the ellipticity as unknown. If we set e = and place a source on the 
axis, then the image will be a circular ring of radius 9q known as the Einstein ring. We 
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Figure 2. Left: Radio map of the gravitational lens system 1608+656. The system was 
observed on 1996 October 10 with the VLA at 8.4 GHz. The map shows four unresolved 
images of the background source (at z B = 1.39). The brightest image (A) is located at the 
origin. (Courtesy of Chris Fassnacht.) Right: A simple model for 1608+656, consisting of two 
elliptical potentials. Solid lines represent logarithmically spaced density contours of the lensing 
mass distribution. The observed image positions (marked with dots) and their relative fluxes 
are accurately reproduced by this model. 

now switch on the perturbation and solve for the source position to linear order assuming 
that e <C 1. We find that the source position is 

[3 = [5 + ee cos24>]s-e6 sm2<t>t , (4.22) 

where S = 8 — 9o, and s, t are unit vectors in the radial and tangential directions respec- 
tively. We can now evaluate the Hessian and find that its eigenvalues are 

hi = 1 h 2 = — - e cos 2(f) , (4.23) 

and the principal axes are rotated with respect to s, t by an angle 

ip = -e sin 20 . (4.24) 

These eigenvalues are the reciprocals of the quasi-radial and quasi-tangential magnifica- 
tions, so that the total magnification is fi = (/11/12) _1 = (<V#o ~ ecos20) . The locus 
of the infinite magnification on the sky, the "critical curve" is then given by h 2 — or 

8 e = e6 cos2<j> . (4.25) 

The critical curve is the image of the "caustic" , the locus in the source plane of points 
that arc infinitely magnified. Its equation is 

(3c = \ficx, Pcy] = 2e0 o [cos 3 (j), - sin 3 0] , (4.26) 

which is the parametric equation of an astroid. If we know the source position, j3, and it 
is located within the astroid then there will be four images located close to the critical 
curve (Fig. ||). If the source lies outside the astroid, there will be two images. 

This model has two free parameters, e and 8q, and as we may have eleven observables 
in a four-image lens (three pairs of relative positions + three flux ratios + two time delay 
ratios), there are many internal consistency checks. For example, the angles of the four 
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Figure 3. Lensing geometry giving rise to a four-image ("quad") configuration. If the 
source is located within the astroid (diamond) caustic produced by an elliptical potential (right 
panel), there will be four tangentially elongated images near the critical curve (solid line) in the 
image plane (left panel). Dotted line in the left panel corresponds to the Einstein ring of the 
unperturbed, circularly symmetric lens. 



4 

y^COS( 



images must satisfy 

4 

= -4jJcos^ . (4.27) 

Now, in order to trust the model, we contend that it is necessary that every observable 
be fit within its measurement error, or there should be a good reason why it can be 
excused. For example, we might have a spectroscopic indication that microlensing is at 
work in one of the images and then we would discount the measurement of its flux. 

Now, it is extremely unlikely that any lens will ever be found that is this simple. So, 
after we are sure that there is inconsistency, we must introduce additional parameters 
or change the model. Following the above approach of expanding about the Einstein 
ring, we can (or may have to) allow the center of the potential to vary, adjust the 
orientation of the perturbations, allow the ellipticity e to vary with radius 8, introduce 
higher harmonics (e.g. oc cos 40 for box-like equipotentials) and so on. Alternatively, 
we can turn to a density-based model. The nature and quality of the observations will 
usually dictate this choice. 

What we discover when we carry out this procedure is that the allowed range of 
observables is quite sensitive not just to the parameters, but to the functional form of 
the model. For example, constraint 4.27 is quite general but can be violated if (not 
unreasonably) we were to add a cos 40 term. In addition, the shape of the unperturbed, 
circular potential H>® controls the quasi-radial magnification and changing it changes hi 
to 1 — d 2 ^ a s /d6 2 . The ratios of time lags are fairly well fixed by the observed image 
magnifications and location; however, the absolute values (in which we are primarily 
interested) are extremely sensitive to radial variation of the ellipticity and this can be 
measured if we have very accurate image positions or an extended source. Finally, we 
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should probably always include the quadratic form Eq. 4.20 to take account of distant 
mass. 

There are further possible consistency checks. For example, if we can make VLBI 
observations of the individual image components, as should be possible for 1608+656 
(Myers et al. 1995, Fassnacht et al. 1996; Fig. ||), and resolve structure in the radial 
direction, we can also check both the radial eigenvalue h\, and the orientation angle if> 
for each of the images. Even better, if we have more than one source, as appears to be 
the case in 1933+503 (Sykes et al. 1996), than we have two independent opportunities 
to test the model and derive the parameters. (This approach may also be of interest for 
cluster arcs.) 

Let us use the quad H1413+117 for illustration and try to model it using a simple 



potential (Eq. 4.21). We can guess the lens center pretty accurately, b ut no t the position 



angle of the major axis of the elliptical perturbation, 4>o- Using Eq. }4.27| , we find four 
choices for </>o and we examine each of these in turn to try to find a consistent solution 
for the source position /3 by varying e, 9q, and the lens center. We find that none 
of our choices of </>o can accomplish this, although one comes close. However, if we 
now compute the magnification ratios, we find that they are quite inconsistent with 
the observations. Therefore we can assuredly reject our lens model. It turns out that 
it is possible to reproduce the positions and fluxes using an elliptical potential with 
parameters {&, s, q, j c , j s , x c , y c } offset by an external shear with parameters r c , T s . 
Although this seems the most plausible model, it is not unique (Yun et al. 1996, preprint) 
and this would be a concern were this a good choice for monitoring. Unfortunately, it is 
not as the predicted time delays are almost certainly too small. 

4.3. Uncertainties in the models 

An important question that we must now address is "How do we assign a formal error 
to the predicted model on the basis of a deflector model?" One procedure is to define a 
X 2 using careful values for the measurement errors and then choose parameters pi that 
minimize this statistic. We then compute a covariance matrix 

dpidpj 

and invert and diagonalize this quantity so as to form the error ellipsoid. We next 
compute the maximum fractional change in the time delay within this error ellipsoid and 
quote this as the fractional error in the derived Hubble constant varying the parameters 
independently until X l v increases by unity. The maximum change in the derived value 
of the delay as we go through this procedure is the required error in the delay. 

However, this is still not enough because existence does not imply uniqueness. Suppose 
that we have a model that truly reproduces all the observables giving an acceptable value 
for x 2 and for which the underlying mass distribution is dynamically possible. We should 
still aggressively explore all other classes of models that can also fit the observations but 
yet which produce disjoint estimates for the time delay. The true uncertainty in the 
Hubble constant is given by the union of all of these models. This is a large task. 
Clearly, we may have to introduce some practical limitations. It is probably quite safe to 
reject dynamically unreasonable mass distributions, for example positive radial gradients 
in surface density, but are we allowed to posit isolated concentrations of unseen mass? 
Of course, only time will tell whether there are genuinely dark galaxies, but until we can 
demonstrate that all significant perturbers are luminous we must allow for this possibility. 

There is already an indication that there may be more unseen mass than we have 
already included in the discovery that the incidence of quad sources relative to doubles 
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is much greater than one might have expected if the lenses are about as elliptical as 
normal galaxies (King & Browne 1996). Three explanations have been advanced for this 
discrepancy. Firstly, the total mass associated with the observed lensing galaxies may be 
highly elliptical, or, indeed, quite irregular. (It appears that observed faint galaxies have 
a median ellipticity of ~ 0.4 as opposed to ~ 0.15 for local galaxies.) Secondly, many 
of the lenses may actually have unseen companions so that the combined potential will 
generally be quite elliptical. Thirdly, as we discuss below, propagation effects associated 
dark matter long the line of sight may actually promote the formation of quads. 



4.4. The double quasar 0957+561 

We now briefly turn from quads to the important case of 0957+561, the only gravitational 
lens where the time delay has been measured with high accuracy (Kundic et al. 1996). 
Traditionally, 0957+561 was thought to be a difficult system to model because of the 
small number of constraints in its two-image configuration and because of the complexity 
of the lensing potential - the primary lensing galaxy Gl is surrounded by a cluster. To 
a large extent, these problems have been resolved in the extensive theoretical study of 
Grogin & Narayan (1996). The crucial set of constraints in the GN model was provided 
by high spatial resolution VLBI mapping of Garrett et al. (1994), which resolved inner 
jets in both images of the source into five centers of emission. Mappin g of o ne jet into the 
other fixed the relative magnification tensor of images A and B (Eq. |2.16 ), as well as its 



gradients along and perpendicular to the jet. This, in turn, provided a tight constraint 
on the radial mass profile (dM/dr) at the image locations, which, together with the total 
enclosed mass M(< r), controls the conversion factor between the time delay and the 
physical distance to the lens. The importance of the dM/dr term was nicely illustrated 
by Wambsganss & Paczyhski (1994). 

The remaining model degeneracy in the 0957+561 system between the lensing galaxy 
Gl and its host cluster cannot be removed by using relative image positions and mag- 
nifications (Falco, Gorenstein & Shapiro 1985). It has to be broken by either directly 
measuring the mass of Gl via its velocity dispersion, or by measuring the surface density 
in the cluster from its weak shearing effect on the images of background galaxies (e.g. 
Tyson, Valdes & Wenk 1990, Kaiser & Squires 1993). If both of these parameters are 
independently determined, they provide an important consistency check on the model. 

Falco et al. (1996, private communication) recently observed Gl with the LRIS spec- 
trograph at the Keck. Their high signal-to-noise spectrum yields a line-of-sight velocity 
dispersion roughly in the range tx b s = 275 ± 30 km/s (2a), consistent with the only 
published measurement by Rhee (1991), er b s = 303 ± 50 km/s. Using a deep CFHT 
image of the field, Fischer et al. (1996) mapped the cluster mass distribution from the 
distortion of faint background galaxies. Adopting their estimate of the mean back- 
ground galaxy redshift (z\, = 1.2) gives the dimensionless cluster surface mass density 
K = S/S c = 0.18 ± 0.11 (2cr). Using these two results and their measurement of the time 
delay, Kundic et al. (1996) find 

H = 67tn kms_1 M P C_1 ( 4 - 29 ) 



0.82 



"±8 



( hns-'Mpc- 1 . (4.30) 

V275kms- 1 / 1 v ; 



The uncertainty in Hq is quoted at 2a and allows for finite aperture effects and anisotropy 
of stellar orbits in conversion of er b s to Gl mass (GN). While this estimate of Hq can 
certainly be improved by future observations, it is already 10% accurate at the la level. 
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5. Systematic errors 

There are a variety of possible systematic errors which afflict gravitational lens deter- 
minations of the Hubble constant. The first of these, which we have already mentioned, 
is the degeneracy between H and a uniform density sheet in the lens plane. For example, 
suppose that there is a uniform density circular disk of matter covering all the images. 
This will act like a simple (Gaussian) converging lens and bring the same rays that would 
have met at the observer to a common focus closer to the lens. In other words, the po- 
tential variation is quadratic and so the contribution to the gravitational delay is also 
quadratic, just like the geometrical delay, from which it is therefore indistinguishable. It 
has been argued that, as any sheet must have positive mass density, we can only set an 
upper bound on the Hubble constant (Falco et al. 1985). 

A second uncertainty is associated with the choice of cosmographic world model. Let 
us suppose that the universe is of homogeneous Friedmann-Robertson- Walker (FRW) 
type so that it is parametrized by the current density parameter Slo- The angular di- 
ameter distances to high redshift sources and lenses depend quite sensitively upon SIq. 
However, the combination DdD s / D^s, relevant for the inferred Hubble constant, is rela- 
tively insensitive. For example, in the 0957+561 system, K = 39.9/i -1 days arcsec -2 for 
an Einstein-de Sitter universe with Slo = 1, and K = 42.9/i -1 days arcsec -2 for an open 
universe with Slo = 0.1. If we take an additional step and introduce a cosmological con- 
stant while keeping the universe flat, the change in K is even smaller. For < SIa < 1, 
K attains a maximum of 41.7/i -1 days arcsec -2 in the Slo = 0.25, SIa = 0.75 model. In a 
lens system with higher redshifts, however, the choice of cosmological model is more im- 
portant. We can thus use high-redshift lenses to constrain the density parameter Slo once 
Ho has been measured. For example, in the z s = 3.62 lensed BAL quasar B1422+231, 
the difference in K between Slo = 1 and Slo = 0.1 models amounts to 24% (assuming the 
lens redshift of Zd = 0.65, as reported by Hammer et al. 1995). 

Now let us turn to inhomogeneous cosmological models. If the universe has an overall 
mean density sufficient to allow it to follow FRW dynamics on the average but with this 
mass confined to small concentrated lumps, none of which intersect the line of sight, 
then individual ray congruences will be subject to zero convergence as they cannot pass 
through any matter. In this case, the angular diameter distance must be changed to the 
affine parameter (Press & Gunn 1973). If, for example, the mass in an Einstein-De Sitter 
universe were concentrated in large, distant lumps, then K would change by over 10% 
for B1422+231. However, this assumption is quite unrealistic, because the combined 
tidal influence of these lumps must, on the average, reproduce the cosmography of a 
homogeneous universe, and, in fact, it docs. In what follows we assume that our line of 
sight is not special in this sense and that individual dark masses are small enough that 
we can treat them as smoothly distributed. 

The next type of systematic error is starting to be taken more seriously and has broader 
implications. This is the error introduced in the measurement of Hq due to large-scale 
structure distributed along the line of sight from the source to the observer. A simple and 
standard description of large-scale structure in the universe is to form the power spectrum 
of relative density fluctuations, P(k), that have supposedly grown from perturbations 
similar to those that we observe in the microwave background fluctuations. The short 
wavelength perturbations [k > k m ~ (10 Mpc) -1 ] have grown to non-linear strength; the 
long wavelength perturbations should still be in the linear regime with P(k) oc k. In the 
linear regime the potential fluctuations therefore satisfy (5$ oc k~ 2 8p cx k~ 2 [k 3 P(k)} 1 ^ 2 ~ 
constant. In fact, the potential fluctuation is not just constant with linear scale but also 
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with time and has a value <5$ ~ 3 x 10 5 c 2 as normalized to the fluctuations measured 



The perturbations that are most important for perturbing gravitational lens images 
are those for which k 2 P(k) is maximized, i.e. with k ~ k m ~ (10 Mpc) -1 . As typical 
angular diameter distances arc D <~ 1 Gpc, these subtend angles ~ 30', much larger 
than the strong lensing regions, and we expect to see N <~ k m D <~ 100 perturbations, of 
both signs, along the line of sight. If we consider a single ray passing through a roughly 
spherical perturbation, it will be deflected through an angle 5a ~ <5$/c 2 and as there are 
N such deflections adding stochastically, the total deflection will 59 ~ N x / 2 5a ~ 1'. (It is 
amusing that although that there has been so much trouble taken to point HST to a small 
fraction of a pixel, the universe has a pointing error of about the width of WFPC chip!) 
In addition, each fluctuation will introduce a propagation time fluctuation 5&/k m c 3 ~ 
(5&/c 2 )(D /Nc). These fluctuations add stochastically giving a total time difference 
relative to a homogeneous universe of St ~ ( < 5$/c 2 )(D/7V 1 / 2 c) ~ 10 4 yr. However, none 
of this matters because there is no way to detect the deflection or time delay of a single 
ray. 

The situation changes when we consider two rays separated by a small angle 9 (Bland- 
ford & Jaroszyhski 1981). On passing though a single perturbation, the angular sepa- 
ration 9 will change by an angle ~ k m (D9)(5&/c 2 ). Again, summing the effects of TV 
such fluctuations stochastically gives an estimate of the total fractional change in 9, a 
measure of the strength of both the magnification and the shear fluctuations. We obtain 



These fluctuations should be coherent over angles ~ (l/k m D) ~ 30' on the sky. There 
have been attempts to measure this signal using the images of faint galaxies (e.g. Mould 
et al. 1994). 

There is a subtlety when we consider the observable time fluctuations. If the sep- 
aration of a pair of rays 9 is much less than the angular scale of the fluctuation, we 
can Taylor expand the potential. We have already dismissed the constant term and 
might expect the linear term to be observable. However, this is not the case as can 
be seen by noting that its effect simply deflects both rays through the same angle, just 
like a prism, and does not increase the optical path at all. We have to expand <E> to 
quadratic order across the rays to obtain a total time difference due to the fluctuations 



of St ~ (5<i>/c 2 )(D/N^ 2 c)(k m 9D) 2 ~ (5<5>/c 2 )N 3 / 2 (D9 2 /c). Now suppose that these two 



rays are associated with a common source and observer in a gravitational lens. The time 
delay in an otherwise homogeneous universe would be t ~ D9 2 /c and so the relative 
change in the arrival time will satisfy 



as above. Therefore to order of magnitude, we find that a few percent error in the derived 
value of the Hubble constant will be introduced by the effects of large-scale structure and 
the effect deserves computing seriously. 

This problem has been addressed by Seljak (1994) and Bar-Kana (1996), following 
earlier papers by Kovner (1987) and Narayan (1991). They find three distinct effects: 

(a) The images of the source, their positions and their shapes, are subject to a global 
shear transformation. This effect will generally be absorbed in the quadratic contribution 
to shear from the large-scale distribution of mass in the deflector plane. 

(b) The images of the deflecting galaxies, their positions relative to the images of the 



by COBE. 




(5.31) 




(5.32) 
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source and their shapes will be subject to an additional shear. Naturally this shear is 
only caused by fluctuations between us and the deflector. This effect means that there 
is a cosmic uncertainty in the image positions which must be taken into account when 
we make the models. 

(c) There will be quadratic potential fluctuations acting on the unperturbed rays that 
cause stochastic gravitational delays that create additional direct changes in the measured 
lags. These will translate directly into errors in H . There is an interesting observational 
possibility here. We can contemplate performing deep redshift surveys in the vicinity of 
the most promising lenses and rather than treat these effects as random errors, try to 
remove them directly. 

6. Is a ten percent measurement of Hq attainable? 

In order for a gravitational lens measurement of H to be of primary importance, 
it is probably going to be necessary for it to have an accuracy ~ 10 percent. Is this 
realistically attainable? Firstly, we can say that it is possible to measure the time delay 
to better than seven percent accuracy as demonstrated by the optical observations of 
0957+561. This should be repeatable in other selected sources. However, accuracies this 
good have yet to be demonstrated at radio wavelengths, partly because variations over 
the expected time delays have smaller amplitude. It is prudent to take into account such 
factors as the expected delay (neither too short nor too long), the signal-to- noise ratio in 
individual flux measurements, and a measured structure function before deciding which 
few sources to monitor regularly. Simulations should be used for deciding how frequently 
to measure the flux. 

Secondly, we need to be able to model the lenses with a total fractional uncertainty 
in the predicted time delay also below ~ 7 percent. Here, there are two challenges. As 
we have discussed, the model itself must be constrained so well that we are confident 
that there are no other models of the deflector that recover all the observables to within 
the measurement errors and yet predict seriously different delays. In addition, we must 
convince ourselves that the perturbative effects of the large-scale structure along the line 
of sight do not influence our result at this level. 

There is a plethora of alternative schemes to measure H , and each one of them has 
enthusiastic advocates. In this competitive environment, all methods carry a burden of 
proof and must demonstrate a reliability and reproducibility if they are to become widely 
accepted. In the particular case of gravitational lenses, this means that we must derive 
consistent values of the Hubble constant in several, probably at least four, cases. It is 
also probably necessary for the models to be specified and analyzed prior to measuring 
the time delay. The considerations outlined above suggest that this goal is attainable 
and, as the gravitational lens method is directly physical and free from the calibration 
uncertainties that bedevil most other methods it is well worth the observational effort to 
carry out this program. 
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